CDS
Accession Number | TCMCG075C25076 |
gbkey | CDS |
Protein Id | XP_017983021.1 |
Location | complement(join(721826..722142,722236..722402,722543..723345,723508..723732,724074..724211,724305..724389,724486..724544,724669..724890,725029..725100,725307..725419,726077..726242,726343..726462,726683..726832,726944..726976)) |
Gene | LOC18587691 |
GeneID | 18587691 |
Organism | Theobroma cacao |
Protein
Length | 889aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018127532.1 |
Definition | PREDICTED: protein RRC1 isoform X6 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | A |
Description | U2 snRNP-associated SURP motif-containing |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko03041 [VIEW IN KEGG] |
KEGG_ko |
ko:K12842
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko03040
[VIEW IN KEGG] map03040 [VIEW IN KEGG] |
GOs |
GO:0005575
[VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0005634 [VIEW IN EMBL-EBI] GO:0005654 [VIEW IN EMBL-EBI] GO:0031974 [VIEW IN EMBL-EBI] GO:0031981 [VIEW IN EMBL-EBI] GO:0043226 [VIEW IN EMBL-EBI] GO:0043227 [VIEW IN EMBL-EBI] GO:0043229 [VIEW IN EMBL-EBI] GO:0043231 [VIEW IN EMBL-EBI] GO:0043233 [VIEW IN EMBL-EBI] GO:0044422 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044428 [VIEW IN EMBL-EBI] GO:0044446 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0070013 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGGCAGCCAAGGGAAAGGAATCTGAGAAGAAGGAGGAGGAAAGGCTGAAGGAGAAGGAGAAGGGAAAGTCTCGAAACATTGATAATTTTATGGAGGAGCTGAAGCATGAACAAGAGATGAGGGAGAGGAGAAATCAGGAACGTGAACATTGGCGTGATGGGCGTCATACTGACAGTTCTGCTCCATCCAGTCGGTTTGATGAGCTGCCTGATGATTTTGATCCAAGTGGAAAACTGCCTGGATCATTTGATGATGGTGATCCTCAAACAACAAATCTCTATGTTGGAAATCTGTCACCAAAGGTTGATGAAAATTTTCTTCTGCGAACTTTTGGAAGATTTGGGCCTATTGCTAGTGTGAAGATTATGTGGCCTAGGACAGAGGAGGAGCGAAGACGGCAAAGAAATTGTGGCTTTGTGGCTTTCATGAATAGAGCTGATGGACAAGCTGCAAAAGATGAAATGCAAGGAGTTGTTGTCTATGAATATGAGTTGAAAATTGGGTGGGGTAAATCTGTTGCTCTTCCATCACAAGCATTACCTGCTCCCCCACCTGGACACATGGCTATCAGGAGCAAGGAGGGTGGTTCTATAATCTTATCTGGTCCTTCAGGCCCACCGGTGACATCTGTTCCGAATCAGAATTCTGAACTGGTTCTTACTCCAAATGTTCCAGATATAATGGTCGCTCCACCTGAGGACAGTCATGTCCACCATGTGATTGATACAATGGCTCTTTATGTTCTTGATGGAGGATGTGCCTTTGAACAAGCTATTATGGAGAGGGGTCGTGGCAACCCTCTATTCAACTTTTTGTTTGTGCTTGGCTCAAAGGAACATACTTACTATGTCTGGAGACTATATTCTTTCGCTCAGGGTGATACTCTTCAAAGGTGGCGGACAGAGCCTTTTATTATGATAACTGGTAGTGGAAGATGGGTACCACCACCTCTGCCAACTACAAAAAGTCCGGAGCATGAAAAGGACTCCACTGCCACATATGCTGCAGGAAGAAGCAGGCGGGTGGAGCCAGAACGAACACTTACTGATCCACAAAGGGATGAATTTGAGGACATGCTACGGGCATTGACATTAGAGAGGAGTCTGATAAAGGAAGCTATGGGTTTTGCTTTGGATAATGCTGATGCTGCTGGAGAGATAGTTGAAGTTTTGACAGAGTCTTTGACACTTAAAGAAACACCTATTCCAACTAAAGTTGCAAGGCTAATGCTTGTTTCTGACATTCTTCATAATAGCAGTGCTCCTGTTAAAAATGCATCTGCATACCGCACCAAATTTGAAGCAACATTGCCTGATATAATGGAGAGCTTTAATGATTTGTACCGCAGTGTAACGGGAAGAATCACGGCCGAGGCCCTTAAGGAACGGGTTCTGAAAGTGTTGCAAGTATGGTCAGACTGGTTTCTTTTTTCAGATGCATATGTGAACGGACTGCGAGCCACTTTTCTTCGATCAGGAAACTCTGGTGTGGCCCCGTTCCATTCTATTTGTGGTGATGCACCAGAAATTGAAAAGAATACTAGTTCAGAAGACGCGGGTGATGGGATTAAGGGCAATCAAGATGCTGCTTTGGCAATGGGCAAAGGTGCAGCTATGAGGGAGCTAATGGATCTCCCTCTTGCTGAGCTGGAAAGACGTTGTAGACATAATGGATTGTCTCTTGTTGGTGGTAGAGAAATAATGGTTGCACGACTGTTAAGCCTGGAAGATGCAGAAAAGCAGAGAAGTTATGAACTAGATGATGACTTGAAGCTTGCACAAAGCCGATCAAGTTCTTGTAGATATTCTAGTGGTCAGAGAGATATAAATGCTGAAGCAGAGCCAGTGGGATTGTCTGGATGGACTCATTATGCAGACAATGAGATCCATTCACAGCGCAAAGGTTCTGTACCTCTGGCTGAAACCCTTCCAATCCCACAGCCTGAAATAAAAGCATTCTTAAAGAAAGAGAAAATCGATCCTGTTTTGCCAGCCTCTAAATGGTCTCGAGAAGATGATGACAGTGATGATGAAGAAAAAAGAAGCACTAGGGGTCTGGGGTTGAGCTACTCGTCTTCTGGAAGTGAGAATGCTGGTGATGGTACTAGTAAGGCTGATGAATTGGAGTTTGGAACTGATGCAAGCATTCCAGCTCCATCTGAAAGTGCAATGAATGAAGAGCAGAGGCAAAAGTTGAGACGTCTGGAGGTTGCTTTGATAGAATATCGAGAATCCCTTGAGGAGCGGGGAATTAAAAGTGCTGAGGATATTGAGAGAAGGGTTGCAGCGCATCGGAAACGGCTAGAATCTGAATATGGTTTATCAGATTCTAGTGAAGATATTTCAGGAAGAAAAAGAACATCTTCAGAGAGGAGAGAAAGGCGAGATGATGCGCACGATTCTTCAAGAAAGCGGCATCGCAGTCAAAGCCGAAGTGAGAGCCCTCCACGGAAATCATCAAACAGAGACAGGGATAGAGAAAACGATTCAGTTAATGACCGGGAAAAGCACAGGGATAGAGATAGAGATAGATCTCATGATCTGGAAAGTGAAAGGGGGAGAGAGAGAGAGCGAGACCGTCGGGAAAAGAGTGGAAGCAGAGAAAGGGATGATCATGATAGGGATAGAGGCAGAGAGAGAGATAGGGATAGGAGGAGGCGAATAAAATGA |
Protein: MAAKGKESEKKEEERLKEKEKGKSRNIDNFMEELKHEQEMRERRNQEREHWRDGRHTDSSAPSSRFDELPDDFDPSGKLPGSFDDGDPQTTNLYVGNLSPKVDENFLLRTFGRFGPIASVKIMWPRTEEERRRQRNCGFVAFMNRADGQAAKDEMQGVVVYEYELKIGWGKSVALPSQALPAPPPGHMAIRSKEGGSIILSGPSGPPVTSVPNQNSELVLTPNVPDIMVAPPEDSHVHHVIDTMALYVLDGGCAFEQAIMERGRGNPLFNFLFVLGSKEHTYYVWRLYSFAQGDTLQRWRTEPFIMITGSGRWVPPPLPTTKSPEHEKDSTATYAAGRSRRVEPERTLTDPQRDEFEDMLRALTLERSLIKEAMGFALDNADAAGEIVEVLTESLTLKETPIPTKVARLMLVSDILHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAGDGIKGNQDAALAMGKGAAMRELMDLPLAELERRCRHNGLSLVGGREIMVARLLSLEDAEKQRSYELDDDLKLAQSRSSSCRYSSGQRDINAEAEPVGLSGWTHYADNEIHSQRKGSVPLAETLPIPQPEIKAFLKKEKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGDGTSKADELEFGTDASIPAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIERRVAAHRKRLESEYGLSDSSEDISGRKRTSSERRERRDDAHDSSRKRHRSQSRSESPPRKSSNRDRDRENDSVNDREKHRDRDRDRSHDLESERGRERERDRREKSGSRERDDHDRDRGRERDRDRRRRIK |